Make get_differential_vars type stable #2698

Ickaser · 2025-05-05T08:49:01Z

Checklist

Appropriate tests were added
Any code changes were done in a way that does not break public API
All documentation related to code changes were updated
The new code follows the
contributor guidelines, in particular the SciML Style Guide and
COLPRAC.
Any new documentation only uses public API

Additional context

Solves #2594 , based on #2594 (comment).

Question on testing

It seems like a good idea to add a test that would have caught #2594; my attempt would be to add @inferred in a couple places. Would

OrdinaryDiffEq.jl/lib/OrdinaryDiffEqRosenbrock/test/dae_rosenbrock_ad_tests.jl

Lines 31 to 42 in cc958fb

    
           @testset "Inplace: $(isinplace(_prob)), DAEProblem: $(_prob isa DAEProblem), BrownBasic: $(initalg isa BrownFullBasicInit), Autodiff: $autodiff" for _prob in [ 
        
                   prob_mm, prob_mm_oop], 
        
               initalg in [BrownFullBasicInit(), ShampineCollocationInit()], autodiff in [true, false] 
        
               alg = Rodas5P(; autodiff) 
        
               function f(p) 
        
                   sol = solve(remake(_prob, p = p), alg, abstol = 1e-14, 
        
                       reltol = 1e-14, initializealg = initalg) 
        
                   sum(sol) 
        
               end 
        
               @test ForwardDiff.gradient(f, [0.04, 3e7, 1e4])≈[0, 0, 0] atol=1e-8 
        
           end

or

OrdinaryDiffEq.jl/lib/OrdinaryDiffEqBDF/test/dae_ad_tests.jl

Lines 14 to 37 in cc958fb

    
           p = [0.04, 3e7, 1e4] 
        
           u₀ = [1.0, 0, 0] 
        
           du₀ = [-0.04, 0.04, 0.0] 
        
           tspan = (0.0, 100000.0) 
        
           differential_vars = [true, true, false] 
        
           prob = DAEProblem(f, du₀, u₀, tspan, p, differential_vars = differential_vars) 
        
           prob_oop = DAEProblem{false}(f, du₀, u₀, tspan, p, differential_vars = differential_vars) 
        
           sol1 = solve(prob, DFBDF(), dt = 1e-5, abstol = 1e-8, reltol = 1e-8) 
        
           sol2 = solve(prob_oop, DFBDF(), dt = 1e-5, abstol = 1e-8, reltol = 1e-8) 
        
           # These tests flex differentiation of the solver and through the initialization 
        
           # To only test the solver part and isolate potential issues, set the initialization to consistent 
        
            @testset "Inplace: $(isinplace(_prob)), DAEProblem: $(_prob isa DAEProblem), BrownBasic: $(initalg isa BrownFullBasicInit), Autodiff: $autodiff" for _prob in [ 
        
                   prob, prob_oop], 
        
               initalg in [BrownFullBasicInit(), ShampineCollocationInit()], autodiff in [true, false] 
        
               alg = DFBDF(; autodiff) 
        
               function f(p) 
        
                   sol = solve(remake(_prob, p = p), alg, abstol = 1e-14, 
        
                       reltol = 1e-14, initializealg = initalg) 
        
                   sum(sol) 
        
               end 
        
               @test ForwardDiff.gradient(f, [0.04, 3e7, 1e4])≈[0, 0, 0] atol=1e-8 
        
           end

be a good place for these? Should this get tested with all the implicit solvers where the mass-matrix DAE formulation is allowed?

ChrisRackauckas · 2025-05-05T11:22:37Z

be a good place for these?

Yeah, the more the marrier.

https://github.com/SciML/OrdinaryDiffEq.jl/blob/master/test/interface/mass_matrix_tests.jl is probably a good test to slam some around.

Ickaser · 2025-05-05T14:27:44Z

Turns out that lots of these are still not inferrable--even with this PR, going from Rosenbrock DAE AD tests, the move from

sol = solve(prob_mm, Rodas5P(), reltol = 1e-8, abstol = 1e-8)

to

using ADTypes: AutoForwardDiff
sol = @inferred solve(prob_mm, Rodas5P(autodiff=AutoForwardDiff(chunksize=3)), reltol = 1e-8, abstol = 1e-8)

already runs into an issue, since the type of solve is still inferred to be Any. I think there is something else going on here (or I don't know how to use @inferred properly, which is possible), but I can't tell what makes this different from the MWE I've been testing against.

That said, even though this PR apparently doesn't do enough for the existing test suite, it is already helping with the MWE and therefore probably my code base.

ChrisRackauckas · 2025-05-05T16:16:37Z

It's fine to sprinkle some @test_brokens around to get things in, and document them.

Ickaser · 2025-05-06T09:20:13Z

OK, so the quirk in the test I was looking at turns out to be that for

M = Diagonal([1.0, 1.0, 0.0])
roberf = ODEFunction(rober, mass_matrix = M)

get_differential_vars(roberf, u0) is fully inferrable, since M was constructed as diagonal. But in the test, we had

M = [1.0 0 0
     0 1.0 0
     0 0 0]
roberf = ODEFunction(rober, mass_matrix = M)

With a full matrix, inference's best guess at get_differential_vars(roberf, u0) is Union{BitVector, DifferentialVarsUndefined}.

We currently get DifferentialVarsUndefined if all of the following are satisfied:

Matrix is not a UniformScaling
Matrix has at least one zero element
Matrix is either an AbstractSciMLOperator or not diagonal

So, for example, a matrix like [1 1 0; 0 1 0; 0 0 1] with one off-diagonal nonzero currently gives DifferentialVarsUndefined.
What should be the exact criteria for having the differential vars undefined? My instinct is that even a fully-zero matrix should still just give us a falses(size(mass_matrix, 1)), so maybe this branch should not even exist?

ChrisRackauckas · 2025-05-06T13:18:57Z

What should be the exact criteria for having the differential vars undefined? My instinct is that even a fully-zero matrix should still just give us a falses(size(mass_matrix, 1)), so maybe this branch should not even exist?

I believe the proper definition is that the number of algebraic variables is determined by the size of the null space of M. If M is not diagonal the algebraic variables are the e_i vectors which live in the null space? I think you can always orthogonalize a basis of the null space down to just being defined by linear combinations the algebraic variables.

But I couldn't figure out how to calculate this easily so the diagonal cases were handled and any case where someone couldn't figure out the differential variables but needed it turned into an error. In lots of cases it's not necessary to calculate so it ended up being an okay solution, with a few special cases like dense. But yes... I don't know, I think in general you probably need to QR factorize the M and then take the Q part and check its null space?

julia> M = [1 1 0; 1 1 0; 0 0 0]
3×3 Matrix{Int64}:
 1  1  0
 1  1  0
 0  0  0

julia> qr(M).R
3×3 Matrix{Float64}:
 -1.41421  -1.41421  0.0
  0.0       0.0      0.0
  0.0       0.0      0.0

That definition isn't quite right either.

Ickaser · 2025-05-06T14:07:53Z

But I couldn't figure out how to calculate this easily so the diagonal cases were handled and any case where someone couldn't figure out the differential variables but needed it turned into an error. In lots of cases it's not necessary to calculate so it ended up being an okay solution, with a few special cases like dense.

OK, that makes sense--I had the sense my solution might be too easy, otherwise it would have already been there. I'll revert it to keep the previous behavior, I think

For at least some of the tests, I can just change to explicitly using a Diagonal matrix, and do likewise in the examples to point people this way, plus maybe add a note on this type-stability question to the mass-matrix docs. That would be easy enough, and that way we avoid doing any possibly-expensive linear algebra (i.e., for cases with larger mass matrices) in what should typically be a simple preparation step.

This reverts commit 5141b52.

ChrisRackauckas · 2025-05-07T10:52:40Z

docs/src/massmatrixdae/BDF.md

-M = [1.0 0 0
-     0 1.0 0
-     0 0 0]
+M = Diagonal([1.0, 1.0, 0])


@AayushSabharwal does MTK use Diagonal?

No, it generates a Matrix{Float64}

Let's update that

Ickaser · 2025-05-07T14:10:44Z

Results of adding @inferred in several places: not as nice as I hoped.

To my confusion: in the Rosenbrock tests, inference works for:

in-place, only when chunksize is specified (regardless of whether AutoSpecialize or FullSpecialize is used)
out-of-place with AutoSpecialize, or with FullSpecialize and chunksize specified

In the OrdinaryDiffEqBDF tests, I find that even with a specified chunksize the precise algorithm type cannot be fully inferred, so for those I added
@test_broken sol = @inferred solve(... . I also tried adding a test using a mass matrix ODE problem where previously there were only DAE problems, but that turns out not to be differentiable. (None of the BDF DAE cases are inferrable.)

With those tests now marked as broken, I believe the tests will run to completion now, where they didn't before.

In the InterfaceII Mass Matrix tests, most use dense mass matrices, for which the solution will not be inferrable; for the diagonal cases it seems to work as far as I tested, but the final example isn't inferrable even if I specify the chunksize:

OrdinaryDiffEq.jl/test/interface/mass_matrix_tests.jl

Lines 313 to 338 in 60e6ba1

    
           # Check https://github.com/SciML/DifferentialEquations.jl/issues/757 
        
           using OrdinaryDiffEq, LinearAlgebra 
        
           n = 3 
        
           Λ_func = t -> exp(-t) * ones(n) |> Diagonal |> Matrix 
        
           τ = 0.2 
        
           function dynamics!(du, u, p, t) 
        
               du .= u - Λ_func(t - τ) 
        
               nothing 
        
           end 
        
           function dynamics(u, p, t) 
        
               u - Λ_func(t - τ) 
        
           end 
        
           x0 = zeros(n, n) 
        
           M = zeros(n * n) |> Diagonal 
        
           M[1, 1] = true # zero mass matrix breaks rosenbrock 
        
           f = ODEFunction{true, SciMLBase.AutoSpecialize}(dynamics!, mass_matrix = M) 
        
           tspan = (0, 10.0) 
        
           adalg = AutoForwardDiff(chunksize=n) 
        
           prob = ODEProblem(f, x0, tspan) 
        
           foop = ODEFunction{false, SciMLBase.AutoSpecialize}(dynamics, mass_matrix = M) 
        
           proboop = ODEProblem(f, x0, tspan) 
        
           @test_broken sol = @inferred solve(prob, Rosenbrock23(autodiff=adalg)) 
        
           @test_broken sol = @inferred solve(prob, Rodas4(autodiff=adalg), initializealg = ShampineCollocationInit()) 
        
           @test_broken sol = @inferred solve(proboop, Rodas5()) 
        
           @test_broken sol = @inferred solve(proboop, Rodas4(), initializealg = ShampineCollocationInit())

ChrisRackauckas · 2025-05-07T17:11:08Z

Okay, let's get what we got in, mark other things as @test_broken and make an issue categorizing what we need to do next.

Ickaser · 2025-05-07T17:26:04Z

Okay, let's get what we got in, mark other things as @test_broken and make an issue categorizing what we need to do next.

That's pretty much where it stands now, I think, so we can merge this and close #2594. The full test suite now runs with a handful of @test_broken cases. I will make a new issue summarizing the state of things

Make get_differential_vars type stable

315b7b2

oscardssmith approved these changes May 5, 2025

View reviewed changes

Remove branch for DifferentialVarsUndefined

5141b52

Ickaser added 3 commits May 6, 2025 20:45

Revert "Remove branch for DifferentialVarsUndefined"

f561d23

This reverts commit 5141b52.

Add some inference checks to DAE tests, make type stable

d7f6976

Update docs examples to use Diagonal

fab6cf6

ChrisRackauckas reviewed May 7, 2025

View reviewed changes

Ickaser added 2 commits May 7, 2025 14:59

Coax tests into completing

54a236b

Broken tests actually complete now

60e6ba1

Ickaser mentioned this pull request May 7, 2025

Add some more details about DAE and mass matrix SciML/DiffEqDocs.jl#780

Merged

9 tasks

Ickaser mentioned this pull request May 7, 2025

Type instabilities in DAE solves #2701

Open

ChrisRackauckas merged commit 444ae15 into SciML:master May 8, 2025
137 of 150 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Make get_differential_vars type stable #2698

Make get_differential_vars type stable #2698

Uh oh!

Ickaser commented May 5, 2025 •

edited

Loading

Uh oh!

ChrisRackauckas commented May 5, 2025

Uh oh!

Ickaser commented May 5, 2025

Uh oh!

ChrisRackauckas commented May 5, 2025

Uh oh!

Ickaser commented May 6, 2025 •

edited

Loading

Uh oh!

ChrisRackauckas commented May 6, 2025

Uh oh!

Ickaser commented May 6, 2025

Uh oh!

ChrisRackauckas May 7, 2025

Uh oh!

AayushSabharwal May 7, 2025

Uh oh!

ChrisRackauckas May 7, 2025

Uh oh!

Ickaser commented May 7, 2025

Uh oh!

ChrisRackauckas commented May 7, 2025

Uh oh!

Ickaser commented May 7, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

	@testset "Inplace: $(isinplace(_prob)), DAEProblem: $(_prob isa DAEProblem), BrownBasic: $(initalg isa BrownFullBasicInit), Autodiff: $autodiff" for _prob in [
	prob_mm, prob_mm_oop],
	initalg in [BrownFullBasicInit(), ShampineCollocationInit()], autodiff in [true, false]

	alg = Rodas5P(; autodiff)
	function f(p)
	sol = solve(remake(_prob, p = p), alg, abstol = 1e-14,
	reltol = 1e-14, initializealg = initalg)
	sum(sol)
	end
	@test ForwardDiff.gradient(f, [0.04, 3e7, 1e4])≈[0, 0, 0] atol=1e-8
	end

	p = [0.04, 3e7, 1e4]
	u₀ = [1.0, 0, 0]
	du₀ = [-0.04, 0.04, 0.0]
	tspan = (0.0, 100000.0)
	differential_vars = [true, true, false]
	prob = DAEProblem(f, du₀, u₀, tspan, p, differential_vars = differential_vars)
	prob_oop = DAEProblem{false}(f, du₀, u₀, tspan, p, differential_vars = differential_vars)
	sol1 = solve(prob, DFBDF(), dt = 1e-5, abstol = 1e-8, reltol = 1e-8)
	sol2 = solve(prob_oop, DFBDF(), dt = 1e-5, abstol = 1e-8, reltol = 1e-8)

	# These tests flex differentiation of the solver and through the initialization
	# To only test the solver part and isolate potential issues, set the initialization to consistent
	@testset "Inplace: $(isinplace(_prob)), DAEProblem: $(_prob isa DAEProblem), BrownBasic: $(initalg isa BrownFullBasicInit), Autodiff: $autodiff" for _prob in [
	prob, prob_oop],
	initalg in [BrownFullBasicInit(), ShampineCollocationInit()], autodiff in [true, false]

	alg = DFBDF(; autodiff)
	function f(p)
	sol = solve(remake(_prob, p = p), alg, abstol = 1e-14,
	reltol = 1e-14, initializealg = initalg)
	sum(sol)
	end
	@test ForwardDiff.gradient(f, [0.04, 3e7, 1e4])≈[0, 0, 0] atol=1e-8
	end

Uh oh!

Make get_differential_vars type stable #2698

Make get_differential_vars type stable #2698

Uh oh!

Conversation

Ickaser commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Additional context

Question on testing

Uh oh!

ChrisRackauckas commented May 5, 2025

Uh oh!

Ickaser commented May 5, 2025

Uh oh!

ChrisRackauckas commented May 5, 2025

Uh oh!

Ickaser commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ChrisRackauckas commented May 6, 2025

Uh oh!

Ickaser commented May 6, 2025

Uh oh!

ChrisRackauckas May 7, 2025

Choose a reason for hiding this comment

Uh oh!

AayushSabharwal May 7, 2025

Choose a reason for hiding this comment

Uh oh!

ChrisRackauckas May 7, 2025

Choose a reason for hiding this comment

Uh oh!

Ickaser commented May 7, 2025

Uh oh!

ChrisRackauckas commented May 7, 2025

Uh oh!

Ickaser commented May 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Ickaser commented May 5, 2025 •

edited

Loading

Ickaser commented May 6, 2025 •

edited

Loading

Ickaser commented May 7, 2025 •

edited

Loading